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Arabic handwritten text recognition has long been a difficult subject, owing 
to the similarity of its characters and the wide range of writing styles. 
However, due to the intricacy of Arabic handwriting morphology, solving 
the challenge of cursive handwriting recognition remains difficult. In this 
paper, we propose a new efficient based image processing approach that 
combines three image descriptors for the feature extraction phase. To 
prepare the training and testing datasets, we applied a series of preprocessing 
techniques to 100 classes selected from the handwritten Arabic database of 
the Institut Fiir Nachrichtentechnik/Ecole Nationale d'Ingénieurs de Tunis 
(IFN/ENIT). Then, we trained the k-nearest neighbor’s algorithm (k-NN) 
algorithm to generate the best model for each feature extraction descriptor. 
The best k-NN model, according to common performance evaluation 
metrics, is used to classify Arabic handwritten images according to their 
classes. Based on the performance evaluation results of the three k-NN 
generated models, the majority-voting algorithm is used to combine the 


prediction results. A high recognition rate of up to 99.88% is achieved, far 
exceeding the state-of-the-art results using the IFN/ENIT dataset. The 
obtained results highlight the reliability of the proposed system for the 
recognition of handwritten Arabic words. 
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1. INTRODUCTION 

The objective of handwriting recognition is to transform a character, word, or text into a habitual 
representation that can be easily processed by the machine. Many applications can take advantage of the 
progress made in the field of automatic handwriting recognition [1], such as reading postal or bank checks, 
reading postal addresses, office automation, and automatic exam copy correction. Few works have been done 
on the recognition of Arabic text writing, unlike other handwriting languages. Although more than 
300 million people in more than 20 countries practice the Arabic language. In addition, the handwriting 
Arabic characters are used in the writing of several historical documents. The complexity of the Arabic script 
form and its cursivity is a very broad field of study and has made great strides. 

The variety of Arabic styles and the complexities of letter morphology make selecting and 
describing extracted features more difficult. In practice, there are two major techniques for handwriting 
recognition. The analytical approach, for example, is based on word segmentation [2]. This method divides 
the handwritten text into graphemes, which are the bottom sections of the letters [3]. In the case of extensive 
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vocabulary, the analytical technique is the only viable solution. On the other hand, the global approach is 
based on a unique description of the word as an undivided unit. This method is far more successful than the 
analytical one because it allows for the development of more interesting recognition systems [4]. 

In recent years, many researchers have attempted to create a system for identifying Arabic words 
[5]-[7]. Almodfer et al. [8] realized a handwritten Arabic recognition system using deep multi-column neural 
networks (DMNN). This system used the Institut Für Nachrichtentechnik/Ecole Nationale d'Ingénieurs de 
Tunis (IFN/ENIT) database and reached an accuracy of 91.5% using a DMNN with three columns. In another 
study Parvez and Mahmoud [9] developed a system that recognizes Arabic handwriting using several 
approaches. This system is based on the text's structural features as well as the syntactic interactions that 
exist between the various elements of an Arabic word. Using the whole IFN/ENIT database, a recognition 
rate of 79% was achieved. Recently, Alyahya et al. [10] created four deep learning models. After all 
convolutional layers, the first two models employed a fully connected layer with/without a dropout layer. 
Two completely linked layers with or without a dropout layer were utilized in the second two models. The 
authors used the Arabic handwritten characters dataset (AHCD) dataset to train and validate the CNN-based 
ResNet-18 model. The highest accuracy of the results achieved was 98.30%. Alalshekmubarak et al. [11] 
proposed a new recognition system that applies a technique for extracting grid features by using a support 
vector machine (SVM) with a standardized polynomial kernel. The authors used the IFN/ENIT database to 
evaluate the performances. The experiments show that the suggested system achieved a 95.27% recognition 
accuracy in a subset of 24 classes using 7971 instances, while 56 classes utilizing 12217 instances achieved a 
92.34% recognition rate. 

By comparing these studies, it turns out that the majority of the proposed systems are faced with 
some problems in the feature extraction step. Therefore, this step affected the results obtained by these 
studies. To overcome this problem, we used in the feature extraction phase a selective feature extraction 
method based on three image descriptor architectures to increase the accuracy of our cursive Arabic word 
recognition system. 

This paper uses the global recognition approach to extract primitives from handwritten Arabic word 
images. The aim is to build a new prediction system capable to recognize handwritten Arabic words 
vocabulary with enhancing recognition rate [12], [13]. We develop an effective selection approach based on 
three image feature descriptors: the histogram of oriented gradients (HOG), the Gabor filter (GF), and the 
local binary pattern (LBP). Indeed, the proposed recognition system is based on the selection of the optimal 
descriptor from three used descriptors to make an accurate prediction. The best feature descriptor is selected 
based on one of the well-known machine learning (ML) algorithms using the k-nearest neighbor’s algorithm 
(k-NN). The best-generated k-NN model is chosen using the majority voting technique combining and 
comparing the performance of the three k-NN models related to the 3 considered feature descriptors. Note 
that the training and testing process was done using the handwritten Arabic IFN/ENIT database. 

The rest of this paper is structured as follows: section 2 provides the dataset, feature extraction 
methods, and the combination method used. The proposed handwriting recognition system's architecture is 
explored in section 3. Experimental results and discussion are given in section 4. In section 5, we summarize 
our study and present some perspectives on this work. 


2. PROPOSED RECOGNITION SYSTEM ARCHITECTURE 

First, we explore the IFN/ENIT dataset and start by extracting the 100 classes that have a large 
occurrence. Following that, we performed a variety of pre-processing operations on the word images to 
increase prediction accuracy. Indeed, it is essentially a question of reducing the noise superimposed on the 
images because the reliability of the recognition is deeply linked to the quality of the word to be processing 
and to the lack of noise [14], [15]. In this work, we were interested in the normalizing procedure since it 
compresses word images to standard sizes (40, 150) and reduces all forms of variances. All images are 
converted to binary format. A pixel can only take black or white values. Moreover, we applied smoothing 
algorithms to increase image quality. This technique examines a pixel's neighbor and assigns it a value of 1 if 
the number of black pixels in this area exceeds a threshold. In Figure 1, we illustrate the overall diagram of 
the modeled system. In addition, we applied a skeletonization preprocessing method to reduce the width of 
the lines. This process is performed on all handwritten word images [16]. We divided the entire dataset into 
two sections, with 80% of the data picked for training and 20% of the data selected for testing. The next step 
is to specify the primitives in a numerical or symbolic form called encoding. One of the most critical 
processes in any recognition system is the feature extraction phase. It involves extracting significant 
information from an image of a specific class to differentiate it more clearly from other classes. A feature 
vector summarizes the information included in the full word in digital image descriptions. The oriented 
gradient histogram, the Gabor filter, and the LBP technique were employed as descriptors. The values of this 
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pertinent information could be real, entire, or binary, depending on context. A vector of features summarizes 
the information included in entire word in numerical descriptions. 


Train Database 


Preprocessing 
- Normalization 
- Smoothing 

- Noise removal 
- Skeletonization 


eo 
Test Database 


Features extraction: HOG 


Features extraction: LBP method 


Features extraction: Gabor Filter 


12490*2448 
ae a et 


Features Matrix 


Bi 


Train KNN Algorithm Train KNN Algorithm 


Selecting the efficient model = 
Classifier combination 


Prediction and Evaluation 


Figure 1. Model construction overview 


In this work, we used the extraction features results obtained by the three studied descriptors. The 
HOG descriptor generates a matrix, which contains 6,047 images; each image is represented on 2,448 values. 
The Gabor filter provided a matrix containing 6,047 images represented on 15,200 values. The LBP method 
converts each image into a vector of 3,540 values. The objective behind the choice of these three feature 
extraction methods is to ensure an improvement in the final precision of the proposed system. On the other 
hand, each of the implemented descriptors uses a particular architecture to generate a vector of features. 
Consequently, these feature vectors' diversity will increase the efficiency of a prediction system. Indeed, the 
features extraction phase has an indirect impact on the learning model's performance. In the result section, we 
will demonstrate our interest in this choice. The obtained vectors by the descriptors represent the 
discriminating and relevant features of each image. In the next step, we have exploited these features to build 
and train the three models using the k-NN algorithm. This training step makes it possible to compare the 
features of the image with the references. For each test input, it is required to characterize the impact of 
adjacent values on target prediction for each test input. The algorithm finds the k closest neighbors consistent 
with a metric determining how similar the inputs are. The k-NN algorithm Tuning is done by using 
optimization methods to determine the optimal hyperparameters. Then, we used the majority voting method 
to combine the classification result obtained by the three models. Finally, the system predicts the correct class 
based on the obtained result given by the majority voting method. 
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3. MATERIALS AND METHODS 
3.1. Dataset description 

The current version 2.0 of the IFN/ENIT dataset is a handwritten Arabic database of Tunisian city 
names. This dataset contains 32,492 images for 937 Tunisian cities provided by over 1,000 authors. The 
database is small in comparison to other language databases. For example, the identity and access 
management (IAM) dataset for the English language comprises over 100,000 instances. The occurrence of 
each city name varies considerably [17]. The variability of occurrence classes influences the performance of 
any system's recognition rate. In this research, we identified 100 classes with occurrences ranging from 60 to 
350. This dataset comprises 15,613 images from 100 different classifications. We divided the created dataset 
into two parts: 12,490 images for training (80%) and 3123 images for testing and validation (20%). Figure 2 
illustrates various images samples from the IFN/ENIT dataset. 


Sota yal daat flair elon 


O'35 Ss EG tyke “es 


Figure 2. Some word images samples from the IFN/ENIT database 


3.2. Feature extraction methods 
3.2.1. Oriented gradient histogram 

The selection of a descriptor is critical to the success of a classification model. The descriptor must 
be representative, discriminating, and fast to accurately predict [18]. The basic principle behind the directed 
gradient histogram is that the distribution of gradient intensity or contour direction may define the local shape 
and form of an item in an image [19]. The HOG descriptor is implemented by splitting the image into small- 
connected sections called cells, and then calculating a histogram of the gradient directions or outline 
orientations for the pixels in each cell. The descriptor is represented by the combination of these histograms. 


3.2.2. Gabor filter 

A Gabor filter (GF) is a sinusoidal function modulated by a Gaussian envelope. The sinusoidal 
function is distinguished by its direction and frequency. Furthermore, a GF may be thought of as a detector of 
edges with a specific orientation. It responds at edges perpendicular to the sinus's propagation direction [20]. 
This sinusoidal function in two dimensions is the sum of two sinusoidal functions, the first pair and real, and 
the second odd and imaginary. We employed two functions: the Gabor filter bank and the Gabor features. 
Gabor Array was obtained using the Gabor filter bank function. This array was passed to the Gabor features 
method as an argument. It constructs a table u by v cells and a bank of Gabor custom filters. The Gabor filter 
bank extracts texture-related information from the examined image in terms of both space and frequency. 


3.2.3. Local binary pattern 

The LBP has been frequently used to analyze image texture [21], [22]. They are basic but efficient 
texture operators that identify pixels in an image depending on their surroundings and are insensitive to 
monotonic changes in illumination. Each label is represented by a binary index. LBPs were extended in [23] 
to allow the usage of neighborhoods of varied sizes. This allows us to compare the central pixel to the P 
evenly dispersed points located on a circle of radius R centered on the central pixel. The gray level values of 
these P points are bilinearly interpolated, allowing us to analyze all conceivable radius and point 
combinations in the neighborhood. 

In an image, we inspect each pixel (x, y) and then select P pixels with radius R. Following that, we 
compute the difference in current intensity between the pixels and the P neighboring pixels. The intensity 
difference threshold is determined by assigning all negative differences to zero and all positive differences to 
one, resulting in a bit vector. Finally, we transform the bit vector to its appropriate decimal value and replace 
it with the pixel intensity value at (x, y). The equation (1) gives the LBP description for each pixel: 


LBPpa(x sy.) = Db S(g,-g, 2° (1) 


with g. and gn denote respectively the intensity of the gray level of the central current and the neighboring 
pixels [24]. P is the number of neighboring pixels chosen at a radius R, and the function S(x) is defined as (2): 


_ flit x20 
Bw lo ifx <O (2) 
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3.3. Majority voting method 

Humans cannot detect a texture based on a particular contact. If a person is given a material to 
categorize, he or she will attempt multiple investigations of the substance's surface before making a decision. 
We employ a majority voting approach that replicates this behavior, but more importantly, majority voting 
significantly enhances the robustness of the classifier, as seen by the experimental findings. The application 
of the majority voting technique is described in algorithm 1. 


Algorithm 1. Majority voting method 


Begin 
Max < 0; 
for i-r 'to C 
S <0; 
for j <= 1 toL 
S & S + T[I,j]; 
end 
if S > Max then 
Max < S; 
Class <= I; 
end if 
end for 
Return Class; 
End 


The simplicity and good performance of majority voting has attracted a lot of attention in the 
literature on the scheme of combining classifiers. A simple analytical justification for majority voting is 
given by the well-known Condorcet theorem [25]. This technique is one of the easiest ways to combine the 
predictions of multiple machine learning algorithms. The principle of combination uses the most probable 
class for each classifier. Indeed, the safest final class is the one supported by the majority of classifiers [26]. 


If we represent the score of the class noted i in the classifier noted j by: sj , then the safest final class is: 


Class= Cint x, si (3) 


where C represents the number of classes, and L represents the number of classifiers. 


4. RESULTS AND DISCUSSION 
4.1. Experimental setup and algorithms best configuration 

The experiment simulations and modeling are developed in the MATLAB R2020b programming 
environment. The experimental computer is powered by a 2.00 GHz turbo Intel 17-8550U CPU, 16 GB of 
RAM, and a 512 GB SSD hard drive. In this sub-section, we present the findings of the three classifiers based 
on the HOG, LBP, and Gabor filter descriptors. We extracted 100 classes with the highest incidence rates 
from the IFN/ENIT dataset. The resulting dataset was separated into two parts: training data (12,490 images) 
and test data (3123 images). All images have been standardized to the same size of 40x150 pixels. We 
adjusted the k-NN classifier's hyperparameters to reduce cross-validation loss. This is done by minimizing 
the following hyper-parameters: sizes of the nearest neighborhood from 1 to 30, and the distance function. 
Figures 3 to 5 represent the curves obtained by the optimization function to find the best hyperparameters for 
the three models studied. 

In Table 1, we present the best hyperparameters found for the three studied classifiers. Moreover, 
we also present the loss and the execution time. From this table, we have suggested more than one better 
combination for a model. We have noticed that some combination requires more execution time to achieve a 
minimum loss rate. This is remarkable for both models based on the Gabor filter and LBP descriptors. In this 
work, the hyper-parameters are chosen according to the minimum loss rate and not according to the 
execution time. 


4.2. Criteria for evaluation 

In general, the confusion matrix is used to evaluate the performance of created models. We 
employed the confusion matrix for multi-class prediction since the handwritten character set has numerous 
classes. The confusion matrix is built by computing the following elements: true positive (TP), true negative 
(TN), false positive (FP), and false negative (FN). The class predicted by the classifier is the expected 
output/label, while the class labels supplied in the dataset are the actual outputs. Because our confusion 
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matrices are multi-class, we must calculate the TP, FP, FN, and TN values for each class in the matrix. In this 
investigation, four score criteria were used: accuracy, precision, sensitivity, and specificity. The equations 
(4)-(7) provide these metrics: 


TP+TN 
Accuracy = ———___ (4) 
TP+TN+FP+FN 
= TP 
Precision = (5) 
TP+FP 
TP 
Sensivity = 6 
YT pLEN (6) 
Specificity = 7 
pecificity TN+FP 0) 
asa i 0.145 p 
ai Min observed objective —— Min observed objective 
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Figure 5. Evaluation function diagram of k-NN model classifier using LBP method 


Table 1. The best configurations obtained by the classifiers using objective function 


Models Loss ___ Runtime (Second) Neighbors __ Distance 
HOG Model 0.1093 130.51 5 correlation 
0.1181 213.11 3 correlation 
0.1268 216.34 3 minkowski 
GF Model 0.0607 899.37 7 correlation 
0.0673 863.75 3 minkowski 
LBP Model 0.1286 306.84 5 cityblock 
0.1399 306.9 1 cityblock 
0.1532 307.76 5 minkowski 
0.1616 288.82 3 minkowski 
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4.3. Testing results 

In this subsection, we present the testing results of the three features extraction methods exploited to 
build the final prediction model. The k-NN algorithm was used in the training step to build the three tuned 
models. Therefore, we tuned the algorithms using the optimal combinations defined in the previous 
subsection. In addition, each model fits the training data extracted by the studied image descriptor, namely 
the HOG, LBP, and GF descriptors. The results obtained during the classification process are represented 
using a confusion matrix. Consequently, the three models provided a particular classification of training 
images. It was difficult to plot three confusion matrices with a dimension of 100 by 100 elements to present 
the classification provided by each classifier. For this purpose, we have summarized these matrices in one table, 
which groups together all the essential elements of a confusion matrix. Based on these matrices, we evaluated the 
trained models performance by computing the measurement metrics. The performance evaluation consists of 
calculating five measures, which are accuracy, sensitivity, specificity, precision, and loss value. From the 
testing results, we can notice that there is a difference in the performances of the generated fourth models. 
The accuracy and error rate of the three basic models varies between [99.69% to 99.85%] and [6.07% to 
12.86%] respectively. Therefore, we observed that the hybrid model achieves an extreme performance. This 
model achieved a maximum accuracy of 99.88% with a confidence interval (CI) of [98.67 to 100%] at 0.95. 


4.4. Discussion 

In this work, we used three image description methods to build a hybrid system for the recognition 
of a limited vocabulary. The IFN/ENIT dataset was used to generate this vocabulary. We picked 100 
different word classes with occurrences ranging from 64 to 368. The quality of the words to be processed and 
the absence of noise are both important factors in ensuring the reliability of the recognition. The selected 
word images were subjected to a range of preprocessing techniques like normalization, smoothing, 
skeletonization, and binarization. We divided the dataset into two halves, with the training set accounting for 
80% of the dataset and the test set accounting for 20%. We used three different description techniques to 
extract significant features and create a feature vector for each image in the feature extraction step: the 
oriented gradient histogram, the Gabor filter, and the LBP approach. Each image is given a vector of 2448, 
15200, and 3540 values using the three descriptors HOG, GF, and LBP. The k-NN technique was used to 
train the three models. By comparing an unknown image to forms recorded in a reference class, the k-NN 
classifier assigns it to the class of its nearest neighbor. To determine the best combination of parameters for 
each method, we ran a parameter optimization. A decision strategy makes it possible to assign confidence 
values to each of the competing classes and to assign the most probable class to the unknown image. The 
generated model is created by combining the three prior models that were trained using the majority voting 
approach. The models' performance is assessed using score measures such as (loss, precision, specificity 
sensitivity) experimental results demonstrate that all models achieved extreme accuracy with a reduced loss 
rate. Moreover, these results affirm that the integration of a hybrid model based on the feature descriptor 
selection method enhances the prediction performance compared to other models in the literature. In Table 2, 
we compare the performance of the proposed model with some related work studied in this article. 

Despite the fact that the results of this study are good, there are some flaws. The biggest drawback is 
that all of the findings come from a single data source. Indeed, the results obtained by the model based on the 
Gabor filter descriptor are the most accurate, despite a significant increase in computational complexity in 
terms of execution time. This is an anticipated trade-off for increased precision, however using parameter 
optimization in future work will improve our results not just in terms of precision but also in terms of 
execution time. 


Table 2. Comparisons with some other related work. 


Author Used model Accuracy Sensitivity Specificity Loss 
Mouhcine et al. [27] Hidden Markov models 87.93% N/A N/A N/A 
Almodfer et al. [8] Multi-column deep neural networks 91,50% N/A N/A N/A 
Alalshekmubarak et al. [11] Support vector machines 92.34% N/A N/A N/A 
Alyahya et al. [10] Convolutional neural network 98.30% N/A N/A N/A 
Balaha et al. [28] Convolutional neural network 90.7% N/A N/A N/A 
Present work k-NN + Majority voting 99.88 % 97.90% 99.86% 1.18% 


5. CONCLUSION AND PERSPECTIVES 

In this study, we provide an architectural system based on a new hybridization approach for 
identifying a vocabulary of handwritten Arabic words. The primary goal was to increase recognition 
accuracy and create a more efficient recognition system. A vocabulary of 100 classes has been chosen from 
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the IFN/ENIT dataset. The performance of three models based on the HOG, GF, and LBP is combined in the 
generated model. The performance of this model reached a recognition rate of 99.88% and a loss rate of 
1.18%. By comparing the findings to current research for a publically available dataset, the generated model's 
performance and reliability for vocabulary recognition of handwritten Arabic words were proven. In future 
works, we aim to design a hybrid architecture by using the convolutional neural networks for features 
extraction phase and the hidden Markov model (HMM) for the classification step. 


REFERENCES 

[1] H. Q. Ghadhban, M. Othman, N. A. Samsudin, M. N. Bin Ismail, and M. R. Hammoodi, “Survey of offline Arabic handwriting 
word recognition,” in Recent Advances on Soft Computing and Data Mining, 2020, pp. 358-372. 

[2] A. Zoizou, A. Zarghili, and I. Chaker, “A new hybrid method for Arabic multi-font text segmentation, and a reference corpus 
construction,” Journal of King Saud University - Computer and Information Sciences, vol. 32, no. 5, pp. 576-582, Jun. 2020, doi: 
10.1016/j.jksuci.2018.07.003. 

[3] H. Bouressace and J. Csirik, “A self-organizing feature map for Arabic word extraction,” in Text, Speech, and Dialogue, 2019, 
pp. 127-136. 

[4] A. Al-Saffar, S. Awang, W. AL-Saiagh, S. Tiun, and A. S. Al-khaleefa, “Deep learning algorithms for Arabic handwriting 
recognition: A review,” International Journal of Engineering & Technology, vol. 7, no. 3.20, 2018, doi: 
10.144 19/jet.v7i3.20.19271. 

[5] N. Alrobah and S. Albahli, “A hybrid deep model for recognizing Arabic handwritten characters,’ IEEE Access, vol. 9, 
pp. 87058-87069, 2021, doi: 10.1109/ACCESS.2021.3087647. 

[6] N. Altwaijry and I. Al-Turaiki, “Arabic handwriting recognition system using convolutional neural network,” Neural Computing 
and Applications, vol. 33, no. 7, pp. 2249-2261, Apr. 2021, doi: 10.1007/s00521-020-05070-8. 

[7] S. P. Deore and A. Pravin, “Ensembling: Model of histogram of oriented gradient based handwritten devanagari character 
recognition system,” Traitement du signal, vol. 34, no. 1-2, pp. 7-20, Oct. 2017, doi: 10.3166/ts.34.7-20. 

[8] R. Almodfer, S. Xiong, M. Mudhsh, and P. Duan, “Multi-column deep neural network for offline Arabic handwriting 
recognition,” in Artificial Neural Networks and Machine Learning, 2017, pp. 260-267. 

[9] M. T. Parvez and S. A. Mahmoud, “Arabic handwriting recognition using structural and syntactic pattern attributes,” Pattern 
Recognition, vol. 46, no. 1, pp. 141-154, Jan. 2013, doi: 10.1016/j.patcog.2012.07.012. 

[10] H. Alyahya, M. M. Ben Ismail, and A. Al-Salman, “Deep ensemble neural networks for recognizing isolated Arabic handwritten 
characters,” ACCENTS Transactions on Image Processing and Computer Vision, vol. 6, no. 21, pp. 68-79, Nov. 2020, doi: 
10.19101/TIPCV.2020.618051. 

[11] A. Alalshekmubarak, A. Hussain, and Q.-F. Wang, “Off-Line handwritten Arabic word recognition using SVMs with normalized 
Poly Kernel,” in Neural Information Processing, 2012, pp. 85-91. 

[12] L. M. Lorigo and V. Govindaraju, “Offline Arabic handwriting recognition: a survey,” IEEE Transactions on Pattern Analysis 
and Machine Intelligence, vol. 28, no. 5, pp. 712-724, May 2006, doi: 10.1109/TPAMI.2006. 102. 

[13] S. Hamida, B. Cherradi, O. Terrada, A. Raihani, H. Ouajji, and S. Laghmati, “A novel feature extraction system for cursive word 
vocabulary recognition using local features descriptors and Gabor filter,’ in 2020 3rd International Conference on Advanced 
Communication Technologies and Networking (CommNet), Sep. 2020, pp. 1-7, doi: 10.1109/CommNet49926.2020.9199642. 

[14] A. T. Sahlol, C. Y. Suen, M. R. Elbasyoni, and A. A. Sallam, “Investigating of preprocessing techniques and novel features in 
recognition of handwritten Arabic characters,” in Advanced Information Systems Engineering, 2014, pp. 264-276. 

[15] S. Hamida, B. Cherradi, O. El Gannour, O. Terrada, A. Raihani, and H. Ouajji, “New database of french computer science Words 
handwritten vocabulary,” in 2021 International Congress of Advanced Technology and Engineering (ICOTEN), Jul. 2021, 
pp. 1-5, doi: 10.1109/ICOTENS2080.2021.9493438. 

[16] K. Saeed, M. Tabedzki, M. Rybnik, and M. Adamski, “K3M: A universal algorithm for image skeletonization and a review of 
thinning techniques,” International Journal of Applied Mathematics and Computer Science, vol. 20, no. 2, pp. 317-335, Jun. 
2010, doi: 10.2478/v10006-010-0024-4. 

[17] H. El Abed and V. Margner, “The IFN/ENIT-database - a tool to develop Arabic handwriting recognition systems,” in 2007 9th 
International Symposium on Signal Processing and Its Applications, Feb. 2007, pp. 1—4, doi: 10.1109/ISSPA.2007.4555529. 

[18] T. Kobayashi, A. Hidaka, and T. Kurita, “Selection of histograms of oriented gradients features for pedestrian detection,” in 
Neural Information Processing, Berlin, Heidelberg: Springer Berlin Heidelberg, 2008, pp. 598—607. 

[19] N. Dalal and B. Triggs, “Histograms of oriented gradients for human detection,” in 2005 IEEE Computer Society Conference on 
Computer Vision and Pattern Recognition (CVPR’05), 2005, vol. 1, pp. 886-893, doi: 10.1109/CVPR.2005.177. 

[20] S. Khan, A. Khan, M. Maqsood, F. Aadil, and M. A. Ghazanfar, “Optimized Gabor feature extraction for mass classification 
using cuckoo search for big data e-healthcare,” Journal of Grid Computing, vol. 17, no. 2, pp. 239-254, Jun. 2019, doi: 
10.1007/s10723-018-9459-x. 

[21] D. Harwood, T. Ojala, M. Pietikäinen, S. Kelman, and L. Davis, “Texture classification by center-symmetric auto-correlation, 
using Kullback discrimination of distributions,’ Pattern Recognition Letters, vol. 16, no. 1, pp. 1-10, Jan. 1995, doi: 
10.1016/0167-8655(94)00061-7. 

[22] T. Ojala, M. Pietikäinen, and D. Harwood, “A comparative study of texture measures with classification based on featured 
distributions,” Pattern Recognition, vol. 29, no. 1, pp. 51-59, Jan. 1996, doi: 10.1016/0031-3203(95)00067-4. 

[23] T. Ojala, M. Pietikainen, and T. Maenpaa, “Multiresolution gray-scale and rotation invariant texture classification with local 
binary patterns,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 7, pp. 971-987, Jul. 2002, doi: 
10.1109/TPAMI.2002.1017623. 

[24] D. Huang, C. Shan, M. Ardabilian, Y. Wang, and L. Chen, “Local binary patterns and its application to facial image analysis: A 
survey,” IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol. 41, no. 6, pp. 765-781, 
Nov. 2011, doi: 10.1109/TSMCC.2011.2118750. 

[25] P. J. Boland, “Majority systems and the condorcet Jury theorem,” The Statistician, vol. 38, no. 3, 1989, doi: 10.2307/2348873. 

[26] S. Tulyakov, S. Jaeger, V. Govindaraju, and D. Doermann, “Review of classifier combination methods,” in Machine Learning in 
Document Analysis and Recognition, 2008, pp. 361-386. 

[27] R. Mouhcine, A. Mustapha, and M. Zouhir, “Recognition of cursive Arabic handwritten text using embedded training based on 
HMMs,” Journal of Electrical Systems and Information Technology, vol. 5, no. 2, pp. 245-251, Sep. 2018, doi: 
10.1016/j.jesit.2017.02.001. 


Efficient feature descriptor selection for improved Arabic handwritten words ... (Soufiane Hamida) 


5312 O ISSN: 2088-8708 


[28] H. M. Balaha, H. A. Ali, M. Saraya, and M. Badawy, “A new Arabic handwritten character recognition deep learning system 
(AHCR-DLS),” Neural Computing and Applications, vol. 33, no. 11, pp. 6325—6367, 2021, doi: 10.1007/s00521-020-05397-2. 


BIOGRAPHIES OF AUTHORS 


Soufiane Hamida © EJ P is 29 years old, from CASABLANCA-Morocco. He gets 
master’s degree in educational technology in 2017 from Higher Normal School of Tetouan, 
Abdelmalek Essaadi University. He is currently an Active Researcher with the Signals, 
Distributed Systems, and AI Research Laboratory, Hassan II University of Casablanca. He is a 
Ph. D student since 2017, for the subject of machine learning methodologies applied for 
pattern recognition. He Published and was contributed for publication for many research 
works. His research interests include handwritten recognition characters and words, 
parameter’s optimization of machine learning models, prediction of patients with heart disease, 
breast cancer, and COVID19 using artificial intelligence techniques. He can be contacted at 
email: hamida.93s @ gmail.com. 


Oussama El Gannour © Ef P is a 28 years old Moroccan from Kenitra. In 2018, he 
obtained a Master’s Specialized degree in Internet of Things and Mobile Services from the 
National Higher School of Computer Science and Systems Analysis of Rabat, Mohammed V 
University. In 2019, he is a PhD student at the Research Laboratory on Electrical Engineering 
and Intelligent Systems of the Hassan II University of Casablanca. His research interests 
contribute to the integration of the internet of things and artificial intelligence in the medical 
field. His research works published include screening patients with heart disease, and 
COVID19 using AI techniques and loT. He can be contacted at email: 
oussama.elgannour@ gmail.com. 


Bouchaib Cherradi © g| P! was born in 1970 at El Jadida, Morocco. He received the 
B.S. degree in Electronics in 1990 and the M.S. degree in Applied Electronics in 1994 from the 
ENSET Institute of Mohammedia, Morocco. He received the DESA diploma in 
Instrumentation of Measure and Control (IMC) from Chouaib Doukkali University at El Jadida 
in 2004. He received his Ph.D. in Electronics and Image processing from the Faculty of 
Science and Technology, Mohammedia. His research interests include applications of 
massively parallel architectures, cluster analysis, pattern recognition, image processing, fuzzy 
logic systems, artificial intelligence, machine learning and deep learning in medical and 
educational data analysis. Dr. Cherradi works actually as an Associate professor in CRMEF-El 
Jadida. In addition, he is associate researcher member of Signals, Distributed Systems and 
Artificial Intelligence (SSDIA) Laboratory in ENSET Mohammedia, Hassan II University of 
Casablanca (UH2C), and LaROSERI Laboratory on leave from the Faculty of Science El 
Jadida (Chouaib Doukali University), Morocco. He is a supervisor of several PhD students. He 
can be contacted at email: bouchaib.cherradi@ gmail.com. 


Hassan Ouajji © g P Professor full, higher education degree at ENSET Mohammedia 
in the Department of Electronic and Electrical Engineering. Researcher member on SSDIA 
laboratory, header of Network and Telecommunications team. His research interest 
includes acoustic electrical engineering, signal processing, electronic, networks and 
telecommunications. Outstanding Reviewer on various indexed journals. Organizing 
committee and Technical Program Committee of tens of international conferences. Published 
tens of refereed journal and conference papers. Member on several project scientific of 
University Hassan II of (Casablanca, Morocco. He can be contacted at email: 
ouajji @hotmail.com. 


Abdelhadi Raihani © EJ Pi was appointed as a professor in Electronics Engineering at 
Hassan II University of Casablanca, ENSET Institute, Mohammedia Morocco since 1991. He 
received the B.S. degree in Electronics in 1987 and the M.S. degree in Applied Electronics in 
1991 from the ENSET Institute. He received his DEA diploma in information processing from 
the Ben M’sik University of Casablanca in 1994. He received the Ph.D. in Parallel 
Architectures Application and image processing from the Ain Chock University of Casablanca 
in 1998. His current research interests are in the medical image processing areas, electrical 
engineering fields, particularly in renewable energy, energy management systems, and power 
and energy systems control. He has more than 46 journal publications and over 60 
international conference papers. He is an active member in national research programs with 
IRESEN under the grant “Green INNO Project/UPISREE”. He supervised several Ph.D. and 
Engineers students in these topics. He can be contacted at email: abraihani@yahoo.fr. 


Int J Elec & Comp Eng, Vol. 12, No. 5, October 2022: 5304-5312 


